WiDS: An Integrated Toolkit for Distributed System Development

نویسندگان

  • Shiding Lin
  • Aimin Pan
  • Zheng Zhang
  • Rui Guo
  • Zhenyu Guo
چکیده

Faced with a proliferation of distributed systems in research and production groups, we have devised the WiDS ecosystem of technologies to optimize the development and testing process for such systems. WiDS optimizes the process of developing an algorithm, testing its correctness in a debuggable environment, and testing its behavior at large scales in a distributed simulation. We have developed many distributed protocols and systems using WiDS, including a large-scale backup service that is robust enough to be deployed. We have also used WiDS to perform ultralarge scale (>1million instances) simulation of a production protocol. In this paper, we describe the principles and design of WiDS, share the lessons that we learned, and discuss on-going research that will further reduce programming and debugging difficulties of distributed systems. † Work is done as intern in Microsoft Research Asia.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

WiDS Checker: Combating Bugs in Distributed Systems

Despite many efforts, the predominant practice of debugging a distributed system is still printf-based log mining, which is both tedious and error-prone. In this paper, we present WiDS Checker, a unified framework that can check distributed systems through both simulation and reproduced runs from real deployment. All instances of a distributed system can be executed within one simulation proces...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

ZEUS: A Toolkit for Building Distributed Multiagent Systems

The multi-agent systems approach of knowledge level co-operation between autonomous agents promises significant benefits to distributed systems engineering, such as enhanced interoperability, scalability and reconfigurability. However, thus far, because of the innate difficulty of constructing multi-agent systems, this promise has been largely unrealised. Hence, there is an emerging desire amon...

متن کامل

BioGrid Application Toolkit: a Grid-based Problem Solving Environment Tool for Biomedical Data Analysis

In this paper, we describe a Problem Solving Environment (PSE) tool called BioGrid Application Toolkit and its architecture over a grid environment. BioGrid Application Toolkit is a general purpose grid access tool that provides a parallel and distributed programming environment; it provides an efficient web-based user interface that allows users to develop, run and visualize parallel/distribut...

متن کامل

RSVC: A Reliable Distributed Control Software Development Toolkit

Modern large accelerator control systems consist of communication software and distributed services that must be reliable, eecient and exible. These requirements motivate the use of object-oriented technology to develop a toolkit for building reliable and fault-tolerant distributed software. This paper describes a C ++ toolkit (RSVC) that reduces eeort of development of reliable distributed con...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005